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ABSTRACT 

The number of z ~ 1 damped Lyman alpha systems (DLAs, log N(K I) ^ 20.3) per 
unit redshift is approximately 0.1, making them relatively rare objects. Large, blind 
QSO surveys for low redshift DLAs are therefore an expensive prospect for space-borne 
UV telescopes. Increasing the efficiency of these surveys by pre-selecting DLA candi- 
dates based on the equivalent widths of metal absorption lines has previously been 
a successful strategy. However, the success rate of DLA identification is still only ~ 
35% when simple equivalent width cut-offs are applied, the majority of systems having 
19.0 < log N(H I) < 20.3. Here we propose a new way to pre-select DLA candidates. 
Our technique requires high-to-moderate resolution spectroscopy of the Mg II A 2796 
transition, which is easily accessible from the ground for 0.2 < z < 2.4. We define 
the D-index, the ratio of the line's equivalent width to velocity spread and measure 
this quantity for 19 DLAs and 8 sub-DLAs in archival spectra obtained with echelle 
spectrographs. For the majority of absorbers, there is a clear distinction between the 
D-index of DLAs compared with sub-DLAs (Kolmogorov-Smirnov probability = 0.8 
%). Based on this pilot data sample, we find that the D-index can select DLAs with a 
success rate of up to 90%, an increase in selection efficiency by a factor of 2.5 compared 
with a simple equivalent width cut. We test the applicability of the D-index at lower 
resolution and find that it remains a good discriminant of DLAs for FWHM < 1.5 A. 
However, the recommended D-index cut-off between DLAs and sub-DLAs decreases 
with poorer resolution and we tabulate the appropriate D-index values that should be 
used with spectra of different resolutions. 
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1 INTRODUCTION 



the low redshift universe has been challenging to survey. 
The Lya line is shifted into the UV at z < 1.5 where 
space telescopes are required. Given the limited resources, 
large-scale, blind surveys for low redshift DLAs are expen- 
sive and largely unpalatable to time assignment commit- 
tees. For this reason, pre-selection of low z DLA candidates 
based on low resolution, ground-based spectra has become 
an important first step in UV surveys (Rao & Turnshek 
2000; Rao, Turnshek & Nestor 2006). The equivalent widths 
(EWs) of resonance lines with red rest wavelengths, such as 
Mg II AA 2796, 2803 and Fe II A 2600, are easily measured 
from the ground in systems with N(K I) > 10 19 cm -2 and 
0.2 < z < 2.4. Rao et al. (2006) showed that ~ 35% of ab- 
sorbers with rest frame EW(Mg II A 2796, Fe II A 2600) 
> 0.5 A have column densities above the canonical DLA 
limit, iV(H I)> 2 x 10 20 cm" 2 . Although absorbers below 
this limit (sub-DLAs) make a non-negligible contribution to 
the total neutral gas content of the universe at high redshifts 



The compilation of large spectroscopic datasets, such as the 
Sloan Digital Sky Survey (SDSS), has proven a major boon 
for the study of QSO absorption lines. These surveys have 
yielded many hundreds of damped Lyman alpha systems 
(DLAs, absorption systems with A r (H I) ^ 2 x 10 20 cm" 2 ) 
which can be used to study the evolution of the galactic 
neutral gas content at z > 1.6 (e.g. Prochaska, Herbert- 
Fort & Wolfe 2005). In the local universe, several wide field 
21cm surveys have characterised the z = distribution of 
neutral hydrogen gas (e.g. Zwaan et al. 2005; Rosenberg 
& Schneider 2003; Ryan- Weber, Webster & Staveley-Smith 
2003). Bridging these extremes, over a redshift range that is 
emerging as a crucial time in galaxy formation, is a high 
priority for absorption line studies. In practice, however, 
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(Peroux et al. 2005; Prochaska et al. 2005), they are usually 
excluded from statistical samples. Therefore, although EW 
pre-selection has been a pivotal strategem of recent Hubble 
Space Telescope (HST) surveys, the 35% 'success rate' of 
metal line pre-selection could be considered low. Moreover, 
whilst a confirmation probability can be applied in a sta- 
tistical sense, there is no way of knowing which individual 
Mg II absorber is most likely to be a DLA, since there is no 
correlation between Mg II EW and iV(H I). For example, a 
DLA confirmation rate of 35% applied to the 0.6 < z < 1.7 
Mg II system survey of Ellison et al. (2004a) yields a DLA 
number density of n{z) = O.lll^g (Ellison 2005). However, 
UV follow-up of the 14 high EW Mg II systems in the El- 
lison et al. sample is still required in order to confirm the 
identity of the expected 5 DLAs. 

In this paper, we investigate whether ground-based 
spectra of metal lines at high resolution can provide addi- 
tional information to help discriminate between DLAs and 
sub-DLAs. Our objective is to identify a quantitative mea- 
sure that will efficiently separate DLAs and sub-DLAs with a 
success rate >> 35%. If the inclusion of sub-DLAs (i.e. false 
positives) can be minimised, the obvious benefit will be that 
UV follow-up for DLA confirmation and iV(H I) determina- 
tion will be far more efficient. Moreover, if the pre-selection 
is complete for high iV(H I) DLAs, any quantity weighted 
by 7V(H I), such as the total neutral gas content, will be 
robust, even if a few lower iV(H I) systems are missed in 
the pre-selection process. Finally, a high success rate DLA 
discriminant could be applied to surveys of Mg II absorp- 
tion systems to directly infer the DLA number density to 
a relatively high degree of accuracy, with no need for UV 
observations. 



2 DATA 

In order to investigate the properties of DLAs and sub- 
DLAs, we compile a sample of absorbers with known iV(H I) 
column densities and echelle spectroscopy which covers the 
Mg II transition. In practice, the high resolution spec- 
troscopy was obtained with either HIRES on Keck or UVES 
on the Very Large Telescope (VLT), see Table Q for refer- 
ences. In the final column of Table^we give the FWHM res- 
olution of the spectrum at the wavelength of the redshifted 
Mg II transition. These spectra were generously provided 
by a number of individuals (see Acknowledgements) with 
the continua already normalized. We return to the impact 
of continuum fitting in the next section. 

The first DLA surveys relied on an equivalent width se- 
lection based on relatively low (~ 10 A FWHM) spectra. 
This selection was therefore susceptible to blending of ab- 
sorbers below the DLA JV(H I) limit. As shown by Ellison 
(2000), for EW < 10 A, the fraction of true DLAs is << 
50%. We therefore only select absorbers from the literature 
whose Af(H I) is based on spectra with a FWHM resolution 
of at least 5 A. In most cases, this is sufficient to adequately 
fit damping wings with errors typically ± 0.1 dex (e.g. El- 
lison et al 2001; Rao, Turnshek & Nestor 2006). In Tabled 
we present the QSOs used in this study, as well as references 
to the iV(H I) column densities and metal abundances which 
have been normalised to the solar (meteoritic) scale of Lod- 
ders (2003). We have assumed that ionization corrections to 



Zn II and Fe II are negligible, which has largely been found 
to be the case even in lower column density systems (Peroux 
et al. 2003). 



In addition to the referenced archival data, we present 
previously unpublished data for the z a b s = 0.900 sub-DLA 
towards Q1629+120 whose N(H I) was determined by Rao 
et al. (2006). The echelle data were obtained as part of ESO 
program 69.A-0410(A) (PI, Athreya) on the UVES spectro- 
graph on UT2 at the VLT. A total of 12,000 seconds of data 
were obtained in service mode between April and May 2002 
using Dicl 346-580, yielding a wavelength coverage of 3100- 
3870, 4800-5750, 5850-6800 A. Data reduction was executed 
using the MIDAS routines of the UVES pipeline as described 
in full by Ellison, Ryan & Prochaska (2001). We fitted the 
continuum using a 3rd order Chebyshev polynomial based 
on absorption-free regions ~ 10 A wide either side of the 
Mg II absorption. The final 'archival' sample used in this 
study therefore consists of 19 DLAs and 8 sub-DLAs which 
have all been observed with either UVES or HIRES and 
therefore have typical FWHM resolutions of 6 - 8 km s - . 



3 THE D-INDEX 

From the high resolution data, we measure the rest frame 
EWs and velocity spread of the Mg II A 2796 absorption 
line. In calculating these quantities, we exclude 'detached' 
absorption components where the continuum is recovered 
and only include the complex with the largest EW. We set 
Amin.max as the limits of absorption that are significant at 
the 3 a level below the continuum, see Figure Q In this 
study we restrict ourselves to the Mg II A 2796 line since we 
are working with archival data with restricted wavelength 
access. Nonetheless, Mg II is a useful line since it has a high 
oscillator strength (and is therefore easy to detect) and is 
observable for most low-to-intermediate redshift DLAs. 

In Figure^ we demonstrate the well known scatter be- 
tween N(R I) and Mg II EW. In our sample, the highest 
EWs are exhibited by absorbers on the cusp of DLA classi- 
fication. Similarly, moderate EWs ~ 1 A can be associated 
with absorbers whose iV(H I) column density ranges over 
two orders of magnitude. It has been previously noted (e.g. 
Nestor et al. 2003) that the EW of Mg II A 2796 is more 
an indication of velocity spread than it is of 7V(H I). This is 
the reason that a simple EW cut-off applied to low resolu- 
tion spectra contains such a high fraction of sub-DLAs. An 
EW cut is unable to distinguish between a few highly satu- 
rated Mg II absorption components and many lower column 
density clouds; the distinction between these two is shown 
in Figure Q Qualitatively, DLAs tend to be characterised 
by the former, with wide, saturated absorption troughs and 
sub-DLAs by the latter. 

In the upper panel of Figure [3] we plot the EW (A) of 
Mg II A 2796 versus its velocity spread (Av, km/s). Within 
the bounds of A m i n and A max , which define the velocity 
spread, it is simple to show that there is a maximum EW. 
For this reason, there is a sharp cut-off in EW in the upper 
panel of Figure |3 where the shaded area shows the locus of 
forbidden values. In the lower panel of Figure [3]tlie data are 
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Table 1. Archival QSO Data 



QSO 


^abs 


N(U I) 


[Zn/H] 


[Fe/H] 


S/N 


D 


Refs (7V(H I),[X/H]) 


Resolution (A) 


000584-01 q 


0.613 


on 0.8+0 20 


+0.10 


—0.31 


65 


8.5 


1.1 


0.10 


O01 004-1 ^o 


2.309 


z,±,0 1 IU.UO 


— 1.53 


— 1.76 


35 


7 4 


2 2 


0.20 




0.931 


on AQ-\-C\ 08 




— 1.49 


60 


6 2 


3 3 


0.17 




0.931 


on A7-I-0 08 




> —1.29 


45 


6 5 


3 3 


0.17 


00827+24 


0.525 


on 3+0 0o 


< —0.04 


— 1.02 


20 


8.5 


4 5 


0.08 


O0841 +1 29 


2.375 


on qq+n 08 


— 1.52 


— 1.70 


17 


8.7 


6.6 


0.21 


00957+561 A 


1.391 


on '3+n. 1 


< —0.75 


— 1.31 


30 


7.6 


7.7 


0.12 


00957+561 r 


1.391 


i q qn+n i n 


< —0.31 


— 1.03 


25 


8.2 


7.7 


0.12 


01 1 (11 — 264 


1.838 




< —0.56 


— 1.46 


40 


5.8 


8.8 


0.17 


01 i nzL 1 80 


1.662 


on Q^-Un 01 


— 1.00 


— 1.55 


40 


9 


9 9 


0.15 


01 1 99— 1 fi8 


0.682 


9f| A^i-I-O O^i 


< —1.32 


— 1.32 


45 


8 2 


10 10 


0.11 


01 1 'SI -+-0fS8 

_1_ J_ ;J _L ^^U<JO 


1.774 


91 30+0 08 






15 


8.6 


11 


0.18 




1.944 


91 m+o i o 


— 1.24 


— 1.61 


30 


8 2 


6 6 


0.23 


01 21 O-r-1 73 


1.892 


20 fiQ+o 08 


—0.86 


— 1.09 


20 


6.9 


6.6 


0.17 


Q1223+175 


2.466 


21.44±0.08 


-1.68 


-1.69 


15 


8.4 


11,11 


0.23 


Q1247+267 


1.223 


19.88±0.10 


-1.02 


-1.37 


75 


5.0 


12,12 


0.12 


Q1331+170 


1.776 


21.14±0.08 


-1.23 


-1.99 


45 


6.5 


2,2 


0.18 


Q1351+318 


1.149 


20.23±0.10 


-0.27 


-0.96 


30 


6.3 


12,12 


0.12 


Q1451+123 


2.255 


20.30±0.15 


-1.08 


-1.44 


6 


8.3 


8,8 


0.16 


Q1622+238 


0.656 


20.4+0.1 






27 


8.5 


4 


0.12 


Q1629+120 


0.900 


19.70±0.04 


-0.18 


-0.86 


33 


5.7 


13,14 


0.12 


Q2128-123 


0.430 


19.37±0.08 




> -0.74 


28 


5.8 


10,10 


0.09 


Q2206-199 


1.920 


20.44±0.08 




-2.54 


15 


8.1 


15,15 




Q2230+023 


1.859 


20.00±0.10 






20 


5.4 


6 


0.17 


Q2231-001 


2.066 


20.53±0.08 


-0.86 


-1.18 


40 


7.8 


2.2 


0.20 


Q2343+125 


2.431 


20.35±0.05 


-0.74 


-1.26 


15 


8.2 


2.2 


0.21 


Q2348-144 


2.279 


20.59±0.08 


< -1.94 


-2.22 


35 


9.2 


6.6 


0.20 



Upper limits are 3<r. 1 - Pettini et al. (2000); 2 - Dessauges-Zavadsky et al. (2004); 3 - Lopez et al. (2005); 4 - Rao & Turnshek (2000); 
5 - Kulkarni et al. (2005); 6 - Dessauges-Zavadsky et al. (2006); 7 - Churchill et al. (2003); 8 - Dessauges-Zavadsky et al. (2003); 9 - 
Lopez et al (1999); 10 - Ledoux, Bergeron & Petitjean (2002); 11 - Dessauges-Zavadsky, private communication; 12 - Pettini et al. 
(1999); 13 - Rao, Turnshek & Nestor (2006); 14 - This work; 15 - Pettini et al. (2002); 

EW are likely to dominate over systematics such as the con- 
tinuum fit. 

In Table we list the D- indices for the archival sam- 
ple. The high D-index absorbers are those that are closest 
to the shaded area in the upper panel of Figure |3] whilst 
the low D-index absorbers have lower EWs for their Av. 
Returning to the example absorbers shown in Figure^gives 
a visual impression of how the D-index relates to spectral 
morphology. 

There is a clear tendency for DLAs to have higher IP- 
indices than sub-DLAs (lower panel, Figure |3J . Although 
there is not a clear dividing line between the populations, a 
D-index cut-off of 6.3 includes all but 1 of the 19 DLAs 
(95%). We consider the fraction of DLAs above the D- 
index cut-off to be the 'success-rate' of this pre-selection; 
for D = 6.3 that rate is 90% in our archival sample of 27 
absorbers. A Kolmogorov-Smirnov (KS) test applied to the 
D-indices shows that the DLAs and sub-DLAs have only a 
0.8% probability of being drawn from the same population. 
In the limited sample available to us, we also note that all 
DLAs with log N(H I) > 20.5 are included by a D > 6.3 cri- 
terion. We experimented with including metallicity informa- 
tion into the D- index, both the [Zn/H] abundance which is 
generally the preferred metallicity indicator, but also [Fe/H]. 
The abundance values used in our testing are given in Table 
We found that including abundance (absolute or rela- 
tive) information did not improve the observed distinction 
between DLAs and sub-DLAs. Although it is desirable to 



plotted as a function of A?"(H I) and we combine the velocity 
and EW information in a single parameter, D, where 

° = , EW J k \ >< 1000 (D 
Av(km/s) v ' 

The error in the EW (8 EW) is calculated from the 1 
a spectral error arrays on a pixel-by-pixel basis and prop- 
agated across the velocity width, Aw, of the line. Since we 
have explicitly defined Av, the error in the D-index (8 D) is 
simply calculated from 

8_D = SEW 

D EW 1 ' 

The errors do not include systematic uncertainties as- 
sociated with, for example, continuum fitting. The effect of 
continuum fitting should be negligible in the determination 
of the D-index for several reasons. First, the Mg II transition 
usually lies well redwards of the Lya forest in a spectral re- 
gion where the continuum is relatively flat and unabsorbed. 
Continuum fitting in these regions is usually straightfor- 
ward. Second, since the majority of the Mg II lines are 
heavily saturated, they will be relatively insensitive to er- 
rors in the continuum fit. We experimented by re-fitting the 
continuum of Q1629+120 with 5 different combinations of 
polynomial and spectral region. Re-calculating the D-index 
shows errors ^ 1%, i.e. smaller than the random errors de- 
rived from Equation 151 We conclude that random errors in 
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Figure 1. Examples of Mg II line profiles for low (upper and 
middle panels) and high (lower panel) D-indices. Qualitatively, a 
low D-index means more residual flux between the limits of the 
velocity spread (as shown by the vertical dashed lines). 
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Figure 2. The rest frame equivalent widths (EW) versus the 
iV(H I) of the archival absorbers listed in Table HI There is no 
correlation between column density and Mg II EW. 
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Figure 3. Upper panel: The EW of the Mg II A 2796 tran- 
sition versus the velocity spread, At!. The shaded region rep- 
resents forbidden values of EW for a given Av. Lower panel: 
D = EW/Av X 1000 as a function of 7V(H I). The horizontal 
dashed line shows a D = 6.3 which most effectively distinguishes 
the DLAs from lower column density absorbers. 



populate this parameter space with many more systems, the 
D-index appears to be a promising method for pre-selecting 
DLAs if high resolution data are available. 



4 DATA REQUIREMENTS 
4.1 Resolution 

We now consider the resolution requirements for the appli- 
cation of the D-index. We test this by convolving each of 
the spectra used in this study with a Gaussian of steadily 
increasing FWHM (applicable at the wavelength of Mg II A 
2796) to simulate the effect of decreasing spectral resolution. 
We re-calculate the effective minimum and maximum wave- 
lengths (which are used in the determination of the EW and 
Av) taking into account the smoothing of the data on the 
S/N ratio. The D-index of each convolved absorber and the 
KS probability of the two populations are then re-calculated. 

We find that the DLA and sub-DLA populations are 
well separated for FWHM < 1.5 A (at the wavelength of red- 
shifted Mg II A 2796), as shown by the examples in Figure[I] 
However, the appropriate cut-off in D-index decreases with 
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decreasing resolution. This is expected given that the EW of 
the absorption line is conserved during the convolution, but 
the velocity spread increases. Lowering the resolution affects 
the D-index of individual absorbers differently, depending 
on their kinematic profiles. For example, the success of the 
D-index in our tests actually improves for FWHM = 0.4 A, 
but this is just an artefact of the particular absorbers in 
our sample. The results of our convolution test highlight the 
need for a larger sample to statistically test the effect of reso- 
lution on the D-index cut-off. Based on our current data, the 
recommended values of D as a function of the resolution, as 
well as the KS probabilities, are given in Table |2] The frac- 
tion of absorbers above the recommended D cut-off which 
are DLAs is given in the column labelled 'Success rate'. The 
fraction of DLAs whose D-index is below the recommended 
cut-off is given in the column labelled 'Missed DLAs'. At 
resolutions below f.5 A it is still possible to set a D-index 
that has a high success rate, but the fraction of missed DLAs 
exceeds 15% by FWHM ~ 1.8 A and the KS probability in- 
dicates that the D-indices of the DLAs and sub-DLAs are 
no longer significantly different. Nonetheless, a resolution of 
1.5 A is within the reach of many existing spectrographs, 
such as ISIS (William Herschel Telescope), EFOSC (ESO 
3.6-m telescope), FORS (VLT) and ESI (Keck), as well as 
the very high resolution instruments used for this study. Un- 
fortunately, however, based on the resolution tests and the 
current data, the SDSS data have slightly too low resolution 
for the D-index to be successfully applied. 



FWHM = 0.4 



As a check of the convolution tests, we measured the 
D-indices of five absorbers with known 7V(H I) and exist- 
ing ESI spectra provided by J. Prochaska. The absorbers 
are those towards SDSS1027+1405 (z aba = 2.441, 7V(H I) 
= 20.30), SDSS2036-0555 (2 abs = 2.280, N(R I) = 21.20), 
SDSS1042+0117 (zabs = 2.267, 7V(H I) = 20.75), SDSS2059- 
0529 (zabs = 2.210, N(H I) = 20.90) and Q1337+11 (z abs = 
2.142, iV(H I) = 20.1). The data were obtained with a 0.5 
arcsec slit and have a spectral resolution ~ 1 A, so we adopt 
a D-index cut-off of 4.9 from Tabled We plot the D-indices 
versus iV(H I) in Figure^ where we see that the DLAs (4/5 
absorbers) all have values above the cut-off value whereas 
the one sub-DLA has D < 4.9. Although more extensive 
testing is required, these initial results from lower resolution 
spectra are encouraging. 

Finally, since resolution has a significant impact on the 
measured D-index, we have checked that the distribution of 
values in Figure [3] is not due to observing coincidence. We 
checked that both the distribution of absorber redshifts and 
spectral resolutions are random with respect to D-index. 
Although the spectral resolutions listed in Tabled do show 
some slight variation, there is no anti-correlation between D- 
index and resolution. In fact, the handful of absorbers with 
FWHM > 0.2 A all have high D-indices. The D-index of 
DLAs is high for all resolutions represented by the archival 
sample, ranging from 0.11 - 0.23 A. This is probably due to 
small numbers since a lower resolution will lead to a lower 
D-index (see Figure 2J. 
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Figure 4. Each of the spectra in our archival sample is convolved 
with a Gaussian of FWHM shown above each panel (see Table 
[5] for the results of the full test). Poorer resolution decreases the 
recommended D-index cut-off for distinction between DLAs and 
sub-DLAs. For FWHM > 1.5 the D-index no longer provides 
a robust discriminant of the two populations. The dotted lines 
show the vector shift between the original D-index (unconvolved 
spectrum) and the convolved data. 



4.2 Signal-to-Noise 

We can do a similar test for the requisite signal-to-noise ra- 
tio (S/N). For two of the absorbers in our sample (one with 
a low D-index, Q 1247+267, and one with a high D-index, 
Q0058+019) we produce a noiseless spectrum from a Voigt 
profile decomposition of the Mg II A 2796 line. Many of the 
components are saturated, so this 'fit' does not yield accu- 
rate column densities and may be degenerate in the absence 
of a simultaneous fit with lower /-value species. However, it 
does yield an accurate spectral profile to which noise can be 
added artificially. We add noise from a Gaussian distribu- 
tion and simultaneously create the corresonding sigma error 
array which is used by the D-index algorithm in determining 
the velocity spread of the line. 

We find very little change in the D-index as a function 
of S/N in the range 10 < S/N < 100, variations are 2% for 
both the high and low D-index absorbers that we test. Only 
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Table 2. Results of Resolution Tests 



Resolution 


Recommended 


Success 


Missed 


KS 


(A FWHM) 


D-index 


Rate (%) 


DLAs (%) 


Prob. 


0.4 


5.7 


91 





0.4 


0.6 


5.4 


90 


5 


0.8 


0.8 


5.1 


90 


5 


0.8 


1.0 


4.9 


90 


5 


0.8 


1.2 


4.6 


90 


5 


0.8 


1.4 


4.4 


90 


5 


0.8 


1.6 


4.2 


90 


10 


0.8 


1.8 


4.0 


89 


16 


2 


2.0 


3.9 


89 


16 


3 


2.2 


3.8 


84 


16 


11 
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Figure 5. D-indices measured from ESI spectra with 1 A resolu- 
tion. The horizontal dashed line shows the recommended cut-off 
for this resolution from Table |21 

at S/N <5 do we see a significant change in the D-index, 
although the magnitude of the change, and its direction de- 
pend upon the structure of the absorber. Nonetheless, it 
is encouraging that the D-index technique is applicable for 
spectra of relatively low quality. 



5 DISCUSSION AND CONCLUSIONS 

We have presented a new technique for discriminating be- 
tween low and high iV(H I) absorbers based solely on high 
resolution spectroscopy of the Mg II A 2796 line. Qualita- 
tively, DLAs tend to be characterised by broad, heavily sat- 
urated Mg II, whereas sub-DLAs often have considerable 
residual flux over their extent. Quantitatively, we define the 
D-index, the ratio of Mg II A 2796 EW to velocity spread. 
DLAs tend to have high D-indices, with a cut of D > 6.3 
yielding a success rate of 90%. Previously, identification of 
DLA candidates based on EW cuts from low resolution spec- 
tra had a DLA confirmation rate of only 35%. A larger sam- 
ple of absorbers which is more representative of the column 
density distribution function (i.e. with a higher fraction of 
sub-DLAs than DLAs), may have a lower success rate de- 
pending on the performance of the D-index at low iV(H I). 
This is difficult to judge from the current sample since only 8 
of our absorbers have N(R I) < 20.3. Both of the sub-DLAs 



with high D can be marked out as 'unusual' in some way; 
Q0058+019 is known to have a very small impact parameter 
to its parent galaxy and is rare in exhibiting solar metallic- 
ity (Pettini et al. 2000) and Q0957+561B is a lensed QSO. 
Whilst we do not suggest these factors as exemptions or ex- 
planations, it once again highlights the need for better sub- 
DLA statistics. In addition, all DLAs with N(K I) > 20.5 
in our pilot sample have D > 6.3, so that pre-selection with 
the D-index includes the systems which dominate the neu- 
tral gas content. Although the column density distribution 
function evolves as a function of redshift (e.g. Prochaska, 
Herbert-Fort & Wolfe 2005; Rao, Turnshek & Nestor 2006; 
Zwaan et al. 2005) it can be approximated to single power 
law f(N) oc N a . For an index a = -1.5, less than 10% of 
the HI gas is in systems with log N(H I) ^ 20.5. Therefore, 
although using the D-index may miss some DLAs with lower 
column densities, for the purposes of pre-selecting DLAs to 
estimate the total neutral gas in DLAs, it is quite robust. 
Similarly, for any property that is weighted by iV(H I), such 
as is common practice with metallicity (e.g. Pettini et al. 
1999; Kulkarni et al 2005), the DLAs missed by D-index 
pre-selection will have a relatively minor effect. 

In order for the D-index to be a successful discriminant 
of DLAs from sub-DLAs, a spectral resolution of at least 1.5 
A FWHM (at the wavelength of Mg II A 2796) is required. 
Based on simulations which degrade the spectral resolution 
of our archival data, we have tabulated the D-index which 
yields the highest DLA return rate for a given instrumental 
FWHM. 

The distinction between DLAs and sub-DLAs encap- 
sulated by the D-index is one of spectral morphology. The 
qualitative tendency for sub-DLAs to have more extended 
velocities for their EWs with a larger number of distinct 
components was previously noted by Churchill et al. (2000) . 
Rao & Turnshek (2000) also discussed the difference in spec- 
tral morphology between high and low iV(H I) absorbers 
noting that 'the DLA systems generally appear to exhibit 
somewhat greater saturation.' The physical reason for this 
difference, which leads to the difference in D-indices in the 
two populations, is still unclear, but we speculate that it 
may be driven by impact parameter (see also Rao & Turn- 
shek 2000). In their study of a gravitationally lensed QSO, 
Ellison et al. (2004b) found that the coherence scale of weak 
(EW < 0.3 A) Mg II absorbers is ~ 2 kpc (fi A = 0.7, Q. M 
— 0.3), whilst stronger absorbers have much larger sizes. El- 
lison et al. discussed a scenario in which galactic halos were 
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composed of numerous Mg II absorbing 'clouds' whose filling 
factor was larger near the centres of galaxies. In this picture, 
a sightline intersecting at small impact parameters would 
show absorption from many Mg II regions, but within a rel- 
atively narrow velocity range. Conversely, sightlines at larger 
impact parameters would exhibit more disjointed absorption 
complexes and may have much wider velocity spreads (de- 
pending on the kinematics of the individual galaxy). This 
would lead to an anti-correlation between D-index and im- 
pact parameter. We can test this prediction with the galaxy- 
Mg II absorber sample of Churchill et al. (2000) for which 
galaxies have been identified for 16 Mg II absorbers (albeit 
with EWs typically lower than those considered in this pa- 
per). Although Churchill et al. (2000) define velocity spread 
differently from us, the ratio with EW (the analog to our 
D-index) does anti-correlate with impact parameter. If the 
HI column density also falls off with impact parameter, this 
would lead to a correlation between D-index and iV(H I) and 
qualitatively explain the distribution of points in Figure [3] 
We conclude that, based on these preliminary investi- 
gations, a combination of EW and velocity spread may be a 
powerful way to pre-select DLA candidates. Although more 
data are required to test the robustness and selection biases 
of this technique it could potentially increase the yield of 
pre-selected DLA candidates by a factor of 2-3 over current 
methods. With more data available, it will also be possible 
to extend the investigation of the D-index to other transi- 
tions which may be suitable for DLA identification, such as 
Fe II transitions longwards of ~ 2300 A. 
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